Sequence data mining
نویسنده
چکیده
Many interesting real-life mining applications rely on modeling data as sequences of discrete multi-attribute records. Existing literature on sequence mining is partitioned on application-specific boundaries. In this article we distill the basic operations and techniques that are common to these applications. These include conventional mining operations like classification and clustering and sequence specific operations like tagging and segmentation. We review state of the art techniques for sequential labeling and show how these apply in two real-life applications arising in address cleaning and information extraction from websites.
منابع مشابه
High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences
Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...
متن کاملComparison of Efficient Algorithms for Sequence Generation in Data Mining
Data mining is the method or the movement of analyzing data from different perspectives and summarizing it into useful information. There are several major data mining techniques that have been developed and are used in the data mining projects which include association, classification, clustering, sequential patterns, prediction and decision tree. Among different tasks in data mining, sequenti...
متن کاملApproaches for Pattern Discovery Using Sequential Data Mining
In this chapter we first introduce sequence data. We then discuss different approaches for mining of patterns from sequence data, studied in literature. Apriori based methods and the pattern growth methods are the earliest and the most influential methods for sequential pattern mining. There is also a vertical format based method which works on a dual representation of the sequence database. Wo...
متن کاملAdvances in Classification of Sequence Data
In recent years, advanced information systems have enabled a collection of increasingly large amounts of data that are sequential in nature. To analyze huge amounts of sequential data, the interdisciplinary field of knowledge discovery in databases (KDD) is very useful. The most important step within the process of KDD is data mining, which is concerned with the extraction of the valid patterns...
متن کاملSequence Mining for Customer Behaviour Predictions in Telecommunications
Predicting the behaviour of customers is challenging, but important for service oriented businesses. Data mining techniques are used to make such predictions, typically using only recent static data. In this paper, a sequence mining approach is proposed, which allows taking historic data and temporal developments into account as well. In order to form a combined classifier, sequence mining is c...
متن کاملSequential Sequence Mining Technique in Mammographic Information Analysis Database
The Sequential Sequence Mining produces large sequences of biomedical data. It provides the opportunities for data analysis and knowledge discovery. The Sequential Sequence Mining provides the efficient and scalable methods to extract the sequences of interest in datasets. The synthetic dataset was taken of the medical images of mammography. The Sequential Sequence Mining technique motivates us...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004